Compression of Compound Documents
نویسنده
چکیده
Compound (or mixed) document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites etc. Because of the very distinct nature of those two image classes (text/graphics vs. pictures), their compression invariably involves multiple compression systems and a region segmentation (classification) method. We review state-of-the-art technologies on the subject while focusing our attention on the mixed raster content (MRC) multi-layer approach. We also present new results on segmentation for MRC based on optimized rate-distortion-based block thresholding.
منابع مشابه
MRC Compression of Compound Documents using H.264/AVC-I
The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multi-layer multiresolution representation of a compound document. It is expected that higher compression can be achieved if more efficient compression standards are used to compress each layer. In this paper we present an MRC compound document codec that uses the H.264/AVC operating in INTRA mode to encode back...
متن کاملLow complexity guaranteed fit compound document compression
We propose a new, very low complexity, single-pass, algorithm for compression of continuous tone compound documents, known as GRAFIT (GuaRAnteed FIT) that can guarantee a minimum compression ratio of as much as 12:1 and even more, for all images in a single pass, while maintaining visually lossless quality when reproduced at resolution 300 dpi or more. The compression ratio is guaranteed in a s...
متن کاملComparison of H.264/Avc-Intra Technique for Compound and Natural Image Compression
Currently, the notion of paperless office is being promoted as part of eco-projects in many industries, where paper documents are converted into electronic documents. These images are termed as ‘Compound images’ and are defined as images that contain a combination of text, natural (photo) images and graphic images. The number of documents stored in electronic format is increasing enormously and...
متن کاملJPEG2000-matched MRC compression of compound documents
The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multilayer decomposition model for compound documents into two contone image layers and a binary mask layer for independent compression. While T.44 does not recommend any procedure for decomposition, it does specify a set of allowable layer codecs to be used after decomposition. While T.44 only allows older stan...
متن کاملLossless Compression for Compound Documents Based on Block Classification
Image and video compressions are required to reduce the number of bits needed to represent the content of the original data. Compression of scanned or compound documents and images can be more difficult than the original data because it is a mixture of text, picture and graphics. The main requirement of the compound document or images is quality of the decompressed data. Here Quality is defined...
متن کاملDocument Compression Using H.264/AVC
It has been verified that H.264/AVC, the newest video compression standard, can also be used to encode still images. In many cases, it outperforms state-of-art coders such as JPEG2000. For compound documents, the gains over JPEG2000 are even more expressive. In this scenario, the contributions of the present paper are distributed over four document encoding methods that use the H.264/AVC as a b...
متن کامل